Are Multiple Reference Translations Necessary? Investigating the Value of Paraphrased Reference Translations in Parameter Optimization

نویسندگان

  • Nitin Madnani
  • Philip Resnik
  • Bonnie J. Dorr
  • Richard Schwartz
چکیده

Most state-of-the-art statistical machine translation systems use log-linear models, which are defined in terms of hypothesis features and weights for those features. It is standard to tune the feature weights in order to maximize a translation quality metric, using heldout test sentences and their corresponding reference translations. However, obtaining reference translations is expensive. In our earlier work (Madnani et al., 2007), we introduced a new full-sentence paraphrase technique, based on English-to-English decoding with an MT system, and demonstrated that the resulting paraphrases can be used to cut the number of human reference translations needed in half. In this paper, we take the idea a step further, asking how far it is possible to get with just a single good reference translation for each item in the development set. Our analysis suggests that it is necessary to invest in four or more human translations in order to significantly improve on a single translation augmented by monolingual paraphrases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Systematic Comparison of Professional and Crowdsourced Reference Translations for Machine Translation

We present a systematic study of the effect of crowdsourced translations on Machine Translation performance. We compare Machine Translation systems trained on the same data but with translations obtained using Amazon’s Mechanical Turk vs. professional translations, and show that the same performance is obtained from Mechanical Turk translations at 1/5th the cost. We also show that adding a Mech...

متن کامل

Trajectory Optimization for a Multistage Launch Vehicle Using Nonlinear Programming

This work is an example of application of nonlinear programming to a problem of three-dimensional trajectory optimization for multistage launch vehicles for geostationary orbit missions. The main objective is to minimize fuel consumption or equivalently to maximize the payload. The launch vehicle considered here, Europa-II, consists of 5 thrust phases and 2 coast phases. Major parameters of the...

متن کامل

The Circle of Meaning: from Translation to Paraphrasing and Back

Title of dissertation: THE CIRCLE OF MEANING: FROM TRANSLATION TO PARAPHRASING AND BACK Nitin Madnani, Doctor of Philosophy, 2010 Dissertation directed by: Professor Bonnie Dorr Department of Computer Science The preservation of meaning between inputs and outputs is perhaps the most ambitious and, often, the most elusive goal of systems that attempt to process natural language. Nowhere is this ...

متن کامل

Using Paraphrases for Parameter Tuning in Statistical Machine Translation

Most state-of-the-art statistical machine translation systems use log-linear models, which are defined in terms of hypothesis features and weights for those features. It is standard to tune the feature weights in order to maximize a translation quality metric, using held-out test sentences and their corresponding reference translations. However, obtaining reference translations is expensive. In...

متن کامل

A Sound Symbolic Study of Translation of Onomatopoeia in Children's Literature: The Case of '' Tintin''

As onomatopoeic words or expressions are attractive, the users of languages in the fields of religion, literature, music, education, linguistics, trade, and so forth wish to utilize them in their utterances. They are more effective and imaginative than the simple words. Onomatopoeic words or expressions attach us to the real nature and to our inner senses. This study aims at familiarity with on...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008